Auditory Scene Analysis: Computational Models
نویسنده
چکیده
Listeners have to make sense of a complex acoustic world containing overlapping sound sources that must be organized into individual auditory objects. Computational auditory scene analysis concerns the use of algorithms inspired by human sound perception whose aim is to extract properties of constituent sound sources in a complexmixture. Starting with representations based on models of how sound is processed in the peripheral auditory system, typical computational auditory scene analysis techniques function by decomposing the mixture into components followed by selective recomposition into groups of components that appear to emanate from a single source. Grouping processes can be informed by information in the signal itself or by the use of prior statistical models of sound sources. This article outlines some of the principal signal decompositions used in models of auditory grouping and goes on to describe a decoder that combines both signaland modeldriven grouping processes.
منابع مشابه
The auditory organization of speech and other sources in listeners and computational models
Speech is typically perceived against a background of other sounds. Listeners are adept at extracting target sources from the acoustic mixture reaching the ears. The auditory scene analysis account holds that this feat is the result of a two stage process. In the first stage, sound is decomposed both within and across auditory nuclei. Subsequent processes of perceptual organisation are informed...
متن کاملTitle : The auditory organization of speech and other sources in listeners and computational models
Speech is typically perceived against a background of other sounds. Listeners are adept at extracting target sources from the acoustic mixture reaching the ears. The auditory scene analysis account holds that this feat is the result of a two stage process: In the first stage sound is decomposed into collections of fragments in several dimensions. Subsequent processes of perceptual organization ...
متن کاملConnectionist Models for Auditory Scene Analysis
Although the visual and auditory systems share the same basic tasks of informing an organism about its environment, most connectionist work on hearing to date has been devoted to the very different problem of speech recognition . VVe believe that the most fundamental task of the auditory system is the analysis of acoustic signals into components corresponding to individual sound sources, which ...
متن کاملBregman's Chimerae: Music Perception as Auditory Scene Analysis
Research into the perception and cognition of music listening often contains implicit assumptions about the nature of the underlying mental representations, and about the relationship between "auditory processing" and "music perception". We attempt to highlight and problemitize some of these assumptions and to provide a more cognitively appropriate model for music perception and cognition, base...
متن کاملUnderconstrained Stochastic Representations for Top-down Computational Auditory Scene Analysis
Since Bregman published his unifying account of psychological results in auditory organization, Auditory Scene Analysis [1], there has been a series computational models of these principles. The dominant approach, as embodied in the dissertations of Cooke [2], Mellinger [3] and Brown [4], and elsewhere [5], may be characterized as follows: First the sound is processed by a conventional signalpr...
متن کامل